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1 <110> APPLICANT: Doutriaux, Marie-Pascale ^ n 

2 Betzner, Andreas /7f^t^~ \ ^ 

3 Freyssinet, Georges f * 

4 Perez, Pascal 

5 <120> TITLE OF INVENTION : METHOD FOR OBTAINING PLANT VARIETIES 

6 <130> FILE REFERENCE: A33153 -PCT-USA 072667,0128 rjCTOClX/Pn 
(fll/-? 7 <140> CURRENT APPLICATION NUMBER: US/09/529 , 239B REOtl V £LJ 

8 <141> CURRENT FILING DATE: 2000-10-27 

9 <150> PRIOR APPLICATION NUMBER: PCT/EP98/06977 grj £ 5 2002 
10 <151> PRIOR FILING DATE: 1998-10-09 



11 <160> NUMBER OF SEQ ID NOS : 103 



ERRORED SEQUENCES 



TECH CENTER 160012900 



153 <210> SEQ ID NO: 12 - ~ 

154 <211> LENGTH: 1250 /yw^^JU^i Kul**4a^ I J emifity 

155 <212> TYPE: DNA //T^_ . [ 

156, .<213> ORGANISM: Arabidopsis thaliana ecotype Columbia ^\^^l2-0~7 

157 <223> OTHER INFORMATION: Clone 52 / 

Q|£> 158 <400> SEQUENCE: 12 ^22/7 

159 cccgggatgg gcaagcaaaa gcagcagacg atttctcgtt tcttcgctcc caaacccaaa 60 , 

160 tccccgactc acgaaccgaa tccggtagcc gaatcatcaa caccgccacc gaagatatcc 120 

161 gccactgtat ccttctctcc ttccaagcgt aagcttctct ccgaccacct cgccgccgcg 180 0^^2-23? 

162 tcacccaaaa agcctaaact ttctcctcac actcaaaacc cagtacccga tcccaattta 240. 

163 caccaaagat ttctccagag atttctggaa ccctcgccgg aggaatatgt tcccgaaacg 300 

164 tcatcatcga ggaaatacac accattggaa cagcaagtgg tggagctaaa gagcaagtac 360 AjLot^j^ 

165 ccagatgtgg ttttgatggt ggaagttggt tacaggtaca gattcttcgg agaagacgcg 4 20 

166 gagatcgcag cacgcgtgtt gggtatttac gctcatatgg atcacaattt catgacggcg 480 

167 agtgtgccaa catttcgatt gaatttccat gtgagaagac tggtgaatgc aggatacaag 540 

168 attggtgtag tgaagcagac tgaaactgca gccattaagt cccatggtgc aaaccggacc 600 

16 9 ggcccttttt tccggggact gtcggcgttg tataccaaag ccacgcttga agcggctgag 660 

170 gatataagtg gtggttgtgg tggtgaagaa ggttttggtt cacagagtaa tttcttggtt 720 

171 tgtgttgtgg atgagagagt taagtcggag acattaggct gtggtattga aatgagtttt 780 

172 gatgttagag tcggtgttgt tggcgttgaa atttcgacag gtgaagttgt ttatgaagag 840 

173 ttcaatgata atttcatgag aagtggatta gaggctgtga ttttgagctt gtcaccagct 900 

174 gagctgttgc ttggccagcc tctttcacaa caaactgaga agtttttggt ggcacatgct 960 

175 ggacctacct caaacgttcg agtggaacgt gcctcactgg attgtttcag caatggtaat 1020 

176 gcagtagatg aggttatttc attatgtgaa aaaatcagcg caggtaactt agaagatgat 1080 

17 7 aaagaaatga agctggaggc tgctgaaaaa ggaatgtctt gcttgacagt tcatacaatt 114 0 

178 atgaacatgc cacatctgac tgttcaagcc ctcgccctaa cgttttgcca tctcaaacag 1200 

179 tttggatttg aaaggatcct ttaccaaggg gcctcatttc gctctttgtc 1250 
201 <210> SEQ ID NO: 15 
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202 <211> LENGTH: 2110 

203 <212> TYPE: DNA ^ 

204 <213> ORGANISM: Arabidopsis thaliana ecotype Columbia ^/30o7 A/^IaJI* 

205 <22 3> OTHER INFORMATION: Clone" 13 /VT ^T^ L 
Q^tS> 206 <400> SEQUENCE: 15 

207 gccacatctg actgttcaag ccctcgccct aacgttttgc catctcaaac agtttggatt 6 0 

208 tgaaaggatc ctttaccaag gggcctcatt tcgctctttg tcaagtaaca cagagatgac 120 

209 tctctcagcc aatactctgc aacagttgga ggttgtgaaa aataattcag atggatcgga 180 

210 atctggctcc ttattccata atatgaatca cacacttaca gtatatggtt ccaggcttct 240 

211 tagacactgg gtgactcatc ctctatgcga tagaaatttg atatctgctc ggcttgatgc 300 

212 tgtttctgag atttctgctt gcatgggatc tcatagttct tcccagctca gcagtgagtt 360 

213 ggttgaagaa ggttctgaga gagcaattgt atcacctgag ttttatctcg tgctctcctc 4 20 

214 agtcttgaca gctatgtcta gatcatctga tattcaacgt ggaataacaa gaatctttca 480 

215 tcggactgct aaagccacag agttcattgc agttatggaa gctattttac ttgcggggaa 540 

216 gcaaattcag cggcttggca taaagcaaga ctctgaaatg aggagtatgc aatctgcaac 6 00 

217 tgtgcgatct actcttttga gaaaattgat ttctgttatt tcatcccctg ttgtggttga 660 

218 caatgccgga aaacttctct ctgccctaaa taaggaagcg gctgttcgag gtgacttgct 7 20 

219 cgacatacta atcacttcca gcgaccaatt tcctgagctt gctgaagctc gccaagcagt 780 

220 tttagtcatc agggaaaagc tggattcctc gatagcttca tttcgcaaga agctcgctat 840 

221 tcgaaatttg gaatttcttc aagtgtcggg gatcacacat ttgatagagc tgcccgttga 900 

222 ttccaaggtc cctatgaatt gggtgaaagt aaatagcacc aagaagacta ttcgatatca 960 

223 tcccccagaa atagtagctg gcttggatga gctagctcta gcaactgaac atcttgccat 1020 

224 tgtgaaccga gcttcgtggg atagtttcct caagagtttc agtagatact acacagattt 1080 

225 taaggctgcc gttcaagctc ttgctgcact ggactgtttg cactcccttt caactctatc 1140 

226 tagaaacaag aactatgtcc gtcccgagtt tgtggatgac tgtgaaccag ttgagataaa 1200 

227 catacagtct ggtcgtcatc ctgtactgga gactatatta caagataact tcgtcccaaa 1260 

228 tgacacaatt ttgcatgcag aaggggaata ttgccaaatt atcaccggac ctaacatggg 1320 

229 aggaaagagc tgctatatcc gtcaagttgc tttaatttcc ataatggctc aggttggttc 1380 

230 ctttgtacca gcgtcattcg ccaagctgca cgtgcttgat ggtgttttca ctcggatggg 1440 

231 tgcttcagac agtatccagc atggcagaag tacctttcta gaagaattaa gtgaagcgtc 1500 

232 acacataatc agaacctgtt cttctcgttc gcttgttata ttagatgagc ttggaagagg 1560 

233 cactagcaca cacgacggtg tagccattgc ctatgcaaca ttacagcatc tcctagcaga 1620 

234 aaagagatgt ttggttcttt ttgtcacgca ttaccctgaa atagctgaga tcagtaacgg 1680 

235 attcccaggt tctgttggga cataccatgt ctcgtatctg acattgcaga aggataaagg 1740 

236 cagttatgat catgatgatg tgacctacct atataagctt gtgcgtggtc tttgcagcag 1800 

237 gagctttggt tttaaggttg ctcagcttgc ccagatacct ccatcatgta tacgtcgagc 1860 

238 catttcaatg gctgcaaaat tggaagctga ggtacgtgca agagagagaa atacacgcat 1920 

239 gggagaacca gaaggacatg aagaaccgag aggcgcagaa gaatctattt cggctctagg 1980 

240 tgacttgttt gcagacctga aatttgctct ctctgaagag gacccttgga aagcattcga 2040 

241 gtttttaaag catgcttgga agattgctgg caaaatcaga ctaaaaccaa cttgttcatt 2100 

242 ttgacccggg 2110 

483 <210> SEQ ID NO: 19 

484 <211> LENGTH: 1081 

485 <212> TYPE: PRT 
4 86 <213> ORGANISM: 
487 <223> OTHER INFORMATION: 

E--> 488 <400> SEQUENCE : 19 

489 Met Gly Lys Gin Lys Gin Gin Thr lie Ser Arg Phe Phe Ala Pro Lys 

490 15 10 15 



.J. 

[: Arabidopsis thaliana ecotype Columbia siJtjy^i DAAJf\J 
[FORMATION : Polypeptide MSH3 ^J^Mr*^ Jv'W' 
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Leu 


Asp 


Gly 


Val 


Phe 


Thr 


Arg 


Met 


Gly 


Ala 


Ser 


Asp 


596 




850 










855 










860 










597 


Ser 


He 


Gin 


His 


Gly Arg 


Ser 


Thr 


Phe 


Leu 


Glu 


Glu 


Leu 


Ser 


Glu 


Ala 


598 


865 










870 










875 










880 


599 


Ser 


His 


He 


He 


Arg 


Thr 


Cys 


Ser 


Ser 


Arg 


Ser 


Leu 


Val 


He 


Leu 


Asp 


600 










885 










890 










895 




601 


Glu 


Leu 


Gly 


Arg 


Gly 


Thr 


Ser 


Thr 


His 


Asp 


Gly 


Val 


Ala 


He 


Ala 


Tyr 


602 








900 










905 










910 






603 


Ala 


Thr 


Leu 


Gin 


His 


Leu 


Leu 


Ala 


Glu 


Lys 


Arg 


Cys 


Leu 


Val 


Leu 


Phe 


604 






915 










920 










925 








605 


Val 


Thr 


His 


Tyr 


Pro 


Glu 


He 


Ala 


Glu 


He 


Ser 


Asn 


Gly 


Phe 


Pro 


Gly 


606 




930 










935 










940 










607 


Ser 


Val 


Gly 


Thr 


Tyr 


His 


Val 


Ser 


Tyr 


Leu 


Thr 


Leu 


Gin 


Lys 


Asp 


Lys 


608 


945 








950 










955 










960 


609 


Gly 


Ser 


Tyr 


Asp 


His 


Asp 


Asp 


Val 


Thr 


Tyr 


Leu 


Tyr 


Lys 


Leu 


Val 


Arg 


610 








965 










970 










975 




611 


Gly 


Leu 


Cys 


Ser 


Arg 


Ser 


Phe 


Gly 


Phe 


Lys 


Val 


Ala 


Gin 


Leu 


Ala 


Gin 


612 








980 










985 










990 






613 


He 


Pro 


Pro 


Ser 


Cys 


He 


Arg 


Arg 


Ala 


He 


Ser 


Met 


Ala 


Ala 


Lys 


Leu 


614 






995 








1000 








1005 








615 


Glu 


Ala 


Glu 


Val 


Arg 


Ala 


Arg 


Glu 


Arg 


Asn 


Thr 


Arg 


Met 


Gly 


Glu 


Pro 


616 


1010 








1015 








1020 










617 


Glu 


Gly 


His 


Glu 


Glu 


Pro 


Arg 


Gly 


Ala 


Glu 


Glu 


Ser 


He 


Ser 


Ala 


Leu 


618 


1025 






1030 








1035 








1040 


619 


Gly Asp 


Leu 


Phe 


Ala 


Asp 


Leu 


Lys 


Phe 


Ala 


Leu 


Ser 


Glu 


Glu 


Asp 


Pro 


620 








1045 








1050 








1055 




621 


Trp 


Lys 


Ala 


Phe 


Glu 


Phe 


Leu 


Lys 


His 


Ala 


Trp 


Lys 


He 


Ala 


Gly 


Lys 


622 




1060 








1065 








1070 






623 


He 


Arg 


Leu 


Lys 


Pro 


Thr 


Cys 


Ser 


Phe 

















624 1075 1080 

685 <210> SEQ ID NO: 26 

686 <211> LENGTH: 2188 

687 <212> TYPE: DNA 

688 <213> ORGANISM: Arabidopsis thaliana ecotype Columbia 

689 <223> OTHER INFORMATION: Clone 43 
E--> 690 <400> SEQUENCE: 26 



6 91 cccgggatgc agcgccagag atcgattttg tctttcttcc aaaaacccac ggcggcgact 

6 92 acgaagggtt tggtttccgg cgatgctgct agcggcgggg gcggcagcgg aggaccacga 

6 93 tttaatgtga aggaagggga tgctaaaggc gacgcttctg tacgttttgc tgtttcgaaa 

694 tctgtcgatg aggttagagg aacggatact ccaccggaga aggttccgcg tcgtgtcctg 

695 ccgtctggat ttaagccggc tgaatccgcc ggtgatgctt cgtccctgtt ctccaatatt 
6 96 atgcataagt ttgtaaaagt cgatgatcga gattgttctg gagagaggag ccgagaagat 
697 gttgttccgc tgaatgattc atctctatgt atgaaggcta atgatgttat tcctcaattt 
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698 




rat"tfTaat"a 

N—r L- l*— V_ » U. C»- C- CL 


a t'Qcrtaaaa c 

U- l~ j3 U. t*. tJ- V— 


fcaacraaacra 


aapfatarf 1" 


ttacrtttr'acr 


tcrcroaaacipt 
y y y y y 


480 


699 




craacttacra t 

*J CLCL w C> UuVj CL C 


cacrtacjaacra 






ttrT'tcrcftrr 1 
^— i— ^ \^ ^ y y 




54 0 


700 






caccftcrcttc 

CL C»» ^ 13 '""* C» * 


tccrcttaaaa 

l_* S_h ^> L. 1— U u. U 


coacrttrtacr 


acrcratcjaaat 


aarttttaaa 


600 


701 




aaaaataaaa 


ttcctatatt 

\* *V C> V CL C* C* 


aoactctaac 


aaaaacrctaa 


aaatactcca 


crciatccQcrtt 

*J U *J ^» t> 


660 


702 




totooaaaaa 

C» ^-J C* ^-j CL>4 CL v ^ CL 


aaaaaoaaat 




^ rra t" i"1"cr 


^3 a t" crcf P+" t" aa 


at ftt ft f era 


720 


703 




at cacrcr crater 


ccaataaaao 

x^n^clcl c* cl u 


accrtcctcrat 


gatccccttt 


acgatagaaa 


caccttacac 


780 


704 




ataccaccta 


atattttcaa 


craaaatCTt ct 


gcatcacaaa 


agcaatattg 


aaatcrttaacr 


840 


705 




aataaatata 

cl^i clcl CL C»CL 


tacracattcrt 

C> *J U ^*CL C* C* C» 


qcttttcttt 


aaagtgggga 


aattttatga 


crr'tcrtatcracr 


900 


706 




p1~a era I - ctpctct 

v c-uy ci ^* y ^y y 


a a 1~1"a cfert" r*a 

CL CL d C» CL y y 1— w- CL 


Cri a crcrs apt* t 


gactggaaga 


tgaccatgag 


t CJ CI t CI t CT CJ CJ 3 


960 


707 




ciaa w cl y aw 






gggatagatg 


aggcagtgea 


UdCiy ^ u-CL 1— L-CL 


1090 

j. \j \j 


708 




cjctccrtcraa t 

y w C • y 1— y y a C 


ataaaottaa 


accrsatccracr 


cagctagaaa 


catctgacca 


o.y ^- ci- Q. a. a. ^ 


1080 

X V \J \J 


709 




aaaaatacta 

tt^-J CL ~1 ™1 C» C> CL 


atactataat 


tcfaaaaaaa 


ctagttcagg 


tattaactcc 


atraaraara 

CL LfWdlA^tLU^^L 


1140 


710 




rinro^ dd era ^ 
cty ^5 j y cl cl 






catcttcttg 


ctataaaaga 


^cxl^^clclclcl L- y 


X £t \J \J 


711 




oaoctacaaa 

y cl y w c» a v*- aaa 






tttgcttttg 


ttgactgtgc 


uy ^y c*y y 


1260 


712 




ttttaaatta 


aotccatcaa 


CCT3 tcra taca 


tcatgtgctg 


ctcttggagc 


crt 1 3 i" t era t" cr 

*H C C CL L- L-y CL C y 


1320 


713 




caggtttctc 


caaacrcraacrt 


attatataac 


agtaaagggc 


tatcaagaga 


aacacaaaao 

CL ~H 'w CL VUuH CL ^4 


1380 


714 




actctaaaaa 


aatatapcrt"t 


aar'acraCTi'r't 


aeggeggtae agttggctcc 


cl y uuv^a^aa 


1440 


715 




crtaatcrcrcrcf g 




tcrctcraacrt't" 


agaaatataa 


tagaatctaa 


r*crcia 1" af^tt 

y CL Cr CL \— • L L L 


1500 


716 




aaaaatt ft t" 


r'I'crast' r*3 , hcr 

V— * ^ U.U. V- V-* u. ^ \4 


craa ptcr1"crfl" 


gttgatggtc 


taaatgaatg 




1560 

_L w/ U 


71 7 








adLLadLCat 


ctgtctaggc 


taaagctaga 


clgaT.yLa.CuL 


icon 


71 
/ J. O 




aagcatgggg 


atatttttcc 


a ^ a a a rt"4" +* 

auaccaagtL 


tacaggggtt 


gtctcagaat 


uy a Lggccag 


i con 


71 Q 




acgatggtaa 


atcttgagat 


-f^ 4~ ^ z» ^ "3 "f - 


agctgtgatg 


gtggtccttc 


ay gg acct uy 


± / fi U 


720 




tacaaatatc 


ttgataactg 


tgttagtcca 


actggtaagc 


gactcttaag 


y ad. l Ly ya tu 


i ft no 


791 




tgccatccac 


tcaaagatgt 


agaaagcatc 


aataaaegge 


ttgatgtagt 


trta arts ^ +- +- /-i 
tyddydaLLC 


lODU 






aeggcaaact 


cagaaagtat 


gcaaatcact 


ggecagtate 


tccacaaact 


LLL-ay aL i_ Let. 


1 Q90 

1 JZU 


723 




gaaagactgc 


teggaegcat 


caagtctagc 


gttcgatcat 


cagcctctgt 


y l uy Ly u l. 


J_ J O 


794 




cttctgggga 


aaaaagtgct 


gaaacaacga 


gttaaagcat 


ttgggcaaat 


i— y uydci.ci.yyy 


904 0 


725 




ttcagaagtg 


gaattgatct 


gttgttggct 


ctacagaagg 


aatcaaatat 


y d i_ y ct y l. i_i_y 




726 




ctttataaac 


tctgtaaact 


tcctatatta 


gtaggaaaaa 


gegggctaga 


aftafttfl-1- 


91 fin 

^ -L D \J 


797 




tctcaattcg 


aagcagecat 


agatagcg 








91 OR 


729 


<210> 


SEQ ID NO: 


27 












730 


<211> 


LENGTH: 1385 












731 


<212> 


TYPE: DNA 














732 


<213> 


ORGANISM: Arabidopsis 


thaliana ecotype Columbia a, 


if 




733 


<223> 


OTHER INFORMATION: Clone 62 










Fl > 7-34 


N*t \J U r 


SEQUENCE: 27 












735 




catcagcctc 


tgtgttgcct 


gctcttctgg 


ggaaaaaagt 


gctgaaacaa 


j -I Ldaciy 




736 




catttgggca 


aattgtgaaa 


gggttcagaa 


gtggaattga 


tctgttgttg 


y ^ L- ^ Ldoay a 


1 90 

J- Jj W 


737 




aggaatcaaa 


tatgatgagt 


ttgetttata 


aactctgtaa 


acttcctata 


ttagtaggaa 


180 


738 




aaageggget 


agagttattt 


ctttctcaat 


tcgaagcagc 


catagatagc 


gactttccaa 


240 


739 




attatcagaa 


ccaagatgtg 


acagatgaaa 


aegctgaaac 


tctcacaata 


cttatcgaac 


300 


740 




tttttatcga 


aagagcaact 


caatggtctg 


aggtcattca 


caccataagc 


tgcctagatg 


360 


741 




tcctgagatc 


ttttgeaate 


gcagcaagtc 


tetctgetgg 


aagcatggcc 


aggcctgtta 


420 


742 




tttttcccga 


atcagaagct 


acagatcaga 


atcagaaaac 


aaaagggeca 


atacttaaaa 


480 


743 




tccaaggact 


atggcatcca 


tttgcagttg 


cagecgatgg 


teaattgect 


gttccgaatg 


54 0 


744 




atatactcct 


tggegagget 


agaagaagca 


gtggcagcat 


tcatcctcgg 


tcattgttac 


600 


745 




tgaegggace 


aaacatgggc 


ggaaaatcaa 


ctcttcttcg 


tgcaacatgt 


ctggccgtta 


660 


746 




tctttgccca 


acttggctgc 


tacgtgccgt 


gtgagtcttg 


cgaaatctcc 


ctcgtggata 


720 


747 




ctatcttcac 


aaggcttggc 


gcatctgata 


gaatcatgac 


aggagagagt 


acctttttgg 


780 
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748 tagaatgcac tgagacagcg tcagttcttc agaatgcaac tcaggattca ctagtaatcc 84 0 

749 ttgacgaact gggcagagga actagtactt tcgatggata cgccattgca tactcggttt 900 

750 ttcgtcacct ggtagagaaa gttcaatgtc ggatgctctt tgcaacacat taccaccctc 960 

751 tcaccaagga attcgcgtct cacccacgtg tcacctcgaa acacatggct tgcgcattca 1020 
7 52 aatcaagatc tgattatcaa ccacgtggtt gtgatcaaga cctagtgttc ttgtaccgtt 1080 
753 taaccgaggg agcttgtcct gagagctacg gacttcaagt ggcactcatg gctggaatac 1140 
7 54 caaaccaagt ggttgaaaca gcatcaggtg ctgctcaagc catgaagaga tcaattgggg 1200 

755 aaaacttcaa gtcaagtgag ctaagatctg agttctcaag tctgcatgaa gactggctca 1260 

756 agtcattggt gggtatttct cgagtcgccc acaacaatgc ccccattggc gaagatgact 1320 

757 acgacacttt gttttgctta tggcatgaga tcaaatcctc ttactgtgtt cccaaataac 1380 

758 ccggg 1385 



1005 


<210> 


SEQ 


ID NO: : 


31 


























1006 


<211> 


LENGTH : 


1109 


























1007 


<212> 


TYPE: PRT 
























jj 




1008 


<213> 


ORGANISM: Arabidopsis thaliana ecotype Columbia 










1009 


<223> 


OTHER INFORMATION: Polypeptide MSH6 








/ 








E--> 1010 


<400> 


SEQUENCE: 31 


























1011 




Met 


Gin 


Arg 


Gin 


Arg 


Ser 


He 


Leu 


Ser 


Phe 


Phe 


Gin 


Lys 


Pro 


Thr 


Ala 


1012 




1 








5 










10 










15 




1013 




Ala 


Thr 


Thr 


Lys 


Gly 


Leu 


Val 


Ser 


Gly 


Asp 


Ala 


Ala 


Ser 


Gly 


Gly 


Gly 


1014 










20 










25 










30 






1015 




Gly 


Ser 


Gly 


Gly 


Pro 


Arg 


Phe 


Asn 


Val 


Arg 


Glu 


Gly 


Asp 


Ala 


Lys 


Gly 


1016 








35 










40 










45 








1017 




Asp 


Ala 


Ser 


Val 


Arg 


Phe 


Ala 


Val 


Ser 


Lys 


Ser 


Val 


Asp 


Glu 


Val 


Arg 


1018 






50 










55 










60 










1019 




Gly 


Thr 


Asp 


Thr 


Pro 


Pro 


Glu 


Lys 


Val 


Pro 


Arg 


Arg 


Val 


Leu 


Pro 


Ser 


1020 




65 










70 










75 










80 


1021 




Gly 


Phe 


Lys 


Pro 


Ala 


Glu 


Ser 


Ala 


Gly 


Asp 


Ala 


Ser 


Ser 


Leu 


Phe 


Ser 


1022 












85 










90 










95 




1023 




Asn 


lie 


Met 


His 


Lys 


Phe 


Val 


Lys 


Val 


Asp 


Asp 


Arg 


Asp 


Cys 


Ser 


Gly 


1024 










100 










105 










110 






1025 




Glu 


Arg 


Ser 


Arg 


Glu 


Asp 


Val 


Val 


Pro 


Leu 


Asn 


Asp 


Ser 


Ser 


Leu 


Cys 


1026 








115 










120 










125 








1027 




Met 


Lys 


Ala 


Asn 


Asp 


Val 


He 


Pro 


Gin 


Phe 


Arg 


Ser 


Asn 


Asn 


Gly 


Lys 


1028 






130 










135 










140 










1029 




Thr 


Gin 


Glu 


Arg 


Asn 


His 


Ala 


Phe 


Ser 


Phe 


Ser 


Gly 


Arg 


Ala 


Glu 


Leu 


1030 




145 










150 










155 










160 


1031 




Arg 


Ser 


Val 


Glu 


Asp 


lie 


Gly 


Val 


Asp 


Gly 


Asp 


Val 


Pro 


Gly 


Pro 


Glu 


1032 












165 










170 










175 




1033 




Thr 


Pro 


Gly 


Met 


Arg 


Pro 


Arg 


Ala 


Ser 


Arg 


Leu 


Lys 


Arg 


Val 


Leu 


Glu 


1034 










180 










185 










190 






1035 




Asp 


Glu 


Met 


Thr 


Phe 


Lys 


Glu 


Asp 


Lys 


Val 


Pro 


Val 


Leu 


Asp 


Ser 


Asn 


1036 








195 










200 










205 








1037 




Lys 


Arg 


Leu 


Lys 


Met 


Leu 


Gin 


Asp 


Pro 


Val 


Cys 


Gly 


Glu 


Lys 


Lys 


Glu 


1038 






210 










215 










220 










1039 




Val 


Asn 


Glu 


Gly 


Thr 


Lys 


Phe 


Glu 


Trp 


Leu 


Glu 


Ser 


Ser 


Arg 


He 


Arg 


1040 




225 










230 










235 










240 


1041 




Asp 


Ala 


Asn 


Arg 


Arg 


Arg 


Pro 


Asp 


Asp 


Pro 


Leu 


Tyr 


Asp 


Arg 


Lys 


Thr 


1042 












245 










250 










255 





Use of n and/or Xaa lias been detected In the Sequence Listing. 

<. Review the Sequence Listing to insure a corresponding 
M explanation is presented in the <220> to <223> fields of 
\ / each sequence using n or Xaa. 
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1043 
1044 
1045 
1046 
1047 
1048 
1049 
1050 
1051 
1052 
1053 
1054 
1055 
1056 
1057 
1058 
1059 
1060 
1061 
1062 
1063 
1064 
1065 
1066 
1067 
1068 
1069 
1070 
1071 
1072 
1073 
1074 
1075 
1076 
1077 
1078 
1079 
1080 
1081 
1082 
1083 
1084 
1085 
1086 
1087 
1088 
1089 
1090 
1091 



Input Set : N:\Crf4\Refhold\l529239B.raw 
Output Set: N:\CRF4\10212002\I529239B.raw 

Leu His He Pro Pro Asp Val Phe Lys Lys Met Ser Ala Ser Gin Lys 

260 265 270 

Gin Tyr Trp Ser Val Lys Ser Glu Tyr Met Asp He Val Leu Phe Phe 

275 280 285 

Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu Leu Asp Ala Glu Leu Gly 

290 295 300 

His Lys Glu Leu Asp Trp Lys Met Thr Met Ser Gly Val Gly Lys Cys 
305 310 315 320 

Arg Gin Val Gly He Ser Glu Ser Gly He Asp Glu Ala Val Gin Lys 

325 330 335 

Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg He Glu Gin Leu Glu Thr 

340 345 350 

Ser Asp Gin Ala Lys Ala Arg Gly Ala Asn Thr He He Pro Arg Lys 

355 360 365 

Leu Val Gin Val Leu Thr Pro Ser Thr Ala Ser Glu Gly Asn He Gly 

370 375 380 

Pro Asp Ala Val His Leu Leu Ala He Lys Glu He Lys Met Glu Leu 
385 390 395 400 

Gin Lys Cys Ser Thr Val Tyr Gly Phe Ala Phe Val Asp Cys Ala Ala 

405 410 415 

Leu Arg Phe Trp Val Gly Ser He Ser Asp Asp Ala Ser Cys Ala Ala 

420 425 430 

Leu Gly Ala Leu Leu Met Gin Val Ser Pro Lys Glu Val Leu Tyr Asp 

435 440 445 

Ser Lys Gly Leu Ser Arg Glu Ala Gin Lys Ala Leu Arg Lys Tyr Thr 

450 455 460 

Leu Thr Gly Ser Thr Ala Val Gin Leu Ala Pro Val Pro Gin Val Met 
465 470 475 480 

Gly Asp Thr Asp Ala Ala Gly Val Arg Asn He He Glu Ser Asn Gly 

485 490 495 

Tyr Phe Lys Gly Ser Ser Glu Ser Trp Asn Cys Ala Val Asp Gly Leu 

500 505 510 

Asn Glu Cys Asp Val Ala Leu Ser Ala Leu Gly Glu Leu He Asn His 

515 520 525 

Leu Ser Arg Leu Lys Leu Glu Asp Val Leu Lys His Gly Asp He Phe 

530 535 540 

Pro Tyr Gin Val Tyr Arg Gly Cys Leu Arg He Asp Gly Gin Thr Met 
545 550 555 560 

Val Asn Leu Glu He Phe Asn Asn Ser Cys Asp Gly Gly Pro Ser Gly 

565 570 575 

Thr Leu Tyr Lys Tyr Leu Asp Asn Cys Val Ser Pro Thr Gly Lys Arg 

580 585 590 

Leu Leu Arg Asn Trp He Cys His Pro Leu Lys Asp Val Glu Ser He 

595 600 605 

Asn Lys Arg Leu Asp Val Val Glu Glu Phe Thr Ala Asn Ser Glu Ser 

610 615 620 

Met Gin He Thr Gly Gin Tyr Leu His Lys Leu Pro Asp Leu Glu Arg 
625 630 635 640 

Leu Leu Gly Arg He Lys Ser Ser Val Arg Ser Ser Ala Ser Val Leu 
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1092 645 650 655 

1093 Pro Ala Leu Leu Gly Lys Lys Val Leu Lys Gin Arg Val Lys Ala Phe 

1094 660 665 670 

1095 Gly Gin lie Val Lys Gly Phe Arg Ser Gly lie Asp Leu Leu Leu Ala 

1096 675 680 685 

1097 Leu Gin Lys Glu Ser Asn Met Met Ser Leu Leu Tyr Lys Leu Cys Lys 

1098 690 695 700 

1099 Leu Pro lie Leu Val Gly Lys Ser Gly Leu Glu Leu Phe Leu Ser Gin 

1100 705 710 715 720 

1101 Phe Glu Ala Ala He Asp Ser Asp Phe Pro Asn Tyr Gin Asn Gin Asp 

1102 725 730 735 

1103 Val Thr Asp Glu Asn Ala Glu Thr Leu Thr He Leu He Glu Leu Phe 

1104 740 745 750 

1105 He Glu Arg Ala Thr Gin Trp Ser Glu Val He His Thr He Ser Cys 

1106 755 760 765 

1107 Leu Asp Val Leu Arg Ser Phe Ala He Ala Ala Ser Leu Ser Ala Gly 

1108 770 775 780 

1109 Ser Met Ala Arg Pro Val He Phe Pro Glu Ser Glu Ala Thr Asp Gin 

1110 785 790 795 800 

1111 Asn Gin Lys Thr Lys Gly Pro He Leu Lys He Gin Gly Leu Trp His 

1112 805 810 815 

1113 Pro Phe Ala Val Ala Ala Asp Gly Gin Leu Pro Val Pro Asn Asp He 

1114 820 825 830 

1115 Leu Leu Gly Glu Ala Arg Arg Ser Ser Gly Ser He His Pro Arg Ser 

1116 835 840 845 

1117 Leu Leu Leu Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Leu Leu Arg 

1118 850 855 860 

1119 Ala Thr Cys Leu Ala Val He Phe Ala Gin Leu Gly Cys Tyr Val Pro 

1120 865 870 875 880 

1121 Cys Glu Ser Cys Glu He Ser Leu Val Asp Thr He Phe Thr Arg Leu 

1122 885 890 895 

1123 Gly Ala Ser Asp Arg He Met Thr Gly Glu Ser Thr Phe Leu Val Glu 

1124 900 905 910 

1125 Cys Thr Glu Thr Ala Ser Val Leu Gin Asn Ala Thr Gin Asp Ser Leu 

1126 915 920 925 

1127 Val He Leu Asp Glu Leu Gly Arg Gly Thr Ser Thr Phe Asp Gly Tyr 

1128 930 935 940 

1129 Ala He Ala Tyr Ser Val Phe Arg His Leu Val Glu Lys Val Gin Cys 

1130 945 950 955 960 

1131 Arg Met Leu Phe Ala Thr His Tyr His Pro Leu Thr Lys Glu Phe Ala 

1132 965 970 975 

1133 Ser His Pro Arg Val Thr Ser Lys His Met Ala Cys Ala Phe Lys Ser 

1134 980 985 990 

1135 Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp Gin Asp Leu Val Phe Leu 

1136 995 1000 1005 

1137 Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu Ser Tyr Gly Leu Gin Val 

1138 1010 1015 1020 

1139 Ala Leu Met Ala Gly He Pro Asn Gin Val Val Glu Thr Ala Ser Gly 

1140 1025 1030 1035 1040 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/529 , 239B 



DATE: 10/21/2002 
TIME: 18:20:2,4 



114-1 
1142 
1143 
1144 
1145 
1146 
1147 
1148 
1149 
1150 



Input Set : N:\Crf4\Refhold\I529239B.raw 
Output Set: N:\CRF4\10212002\I529239B.raw 

Ala Ala Gin Ala Met Lys Arg Ser He Gly Glu Asn Phe Lys Ser Ser 

1045 1050 1055 

Glu Leu Arg Ser Glu Phe Ser Ser Leu His Glu Asp Trp Leu Lys Ser 

1060 1065 1070 

Leu Val Gly He Ser Arg Val Ala His Asn Asn Ala Pro He Gly Glu 

1075 1080 1085 

Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp His Glu He Lys Ser Ser 

1090 1095 1100 

Tyr Cys Val Pro Lys 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 10/21/2002 

PATENT, APPLICATION: US/09/529 , 2 39B TIME: 18:20:25 

Input Set : N:\Crf4\Refhold\l529239B.raw 
Output Set: N:\CRF4\10212002\l529239B.raw 

Invalid Line Length: 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:l; Line(s) 30 

Seq#:2; Line(s) 53 

Seq#:3; Line(s) 68 

Seq#:4; Line(s) 78 

Seq#:5; Line(s) 88 

Seq#:6; Line(s) 98 

Seq#:7; Line(s) 108 

Seq#:8; Line(s) 118 

Seq#:9; Line(s) 128 

Seq#:10; Line(s) 138 

Seq#:ll; Line(s) 148 

Seq#:12; Line(s) 175,176,177,178 

Seq#:13; Line(s) 186 

Seq#:14; Line(s) 196 

Seq#:15; Line(s) 223,224,225,226,227,228,229,230,231,232,233,234,235,236 

Seq#:15; Line(s) 237,238,239,24 0,241 

Seq#:16; Line(s) 249 

Seq#:17; Line(s) 2 59 

Seq#:18; Line(s) 271,479,480 

Seq#:20; Line(s) 631 

Seq#:21; Line(s) 641 

Seq#:22; Line(s) 650 

Seq#:23; Line(s) 660 

Seq#:24; Line(s) 670 

Seq#:25; Line{s) 680 

Seq#:26; Line(s) 707,708,709,710,711,712,713,714,715,716,717,718,719,720 

Seq#:26; Line(s) 721,722,723,724,725,726,727 

Seq#:27; Line(s) 7 51,752,753,754,755,756,757 

Seq#:28; Line(s) 765 

Seq#:29; Line(s) 775 

Seq#:30; Line(s) 787,1001,1002 

Seq#:32; Line(s) 1157 

Seq#:33; Line(s) 1166 

Seq#:34; Line(s) 1175 

Seq#:35; Line(s) 1185 

Seq#:36; Line(s) 1195 

Seq#:37; Line(s) 1205 

Seq#:38; Line(s) 1215 

Seq#:39; Line(s) 1225 

Seq#:40; Line(s) 1235 

Seq#:41; Line(s) 1245 

Seq#:42; Line(s) 1255 

Seq#:43; Line(s) 1265 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 10/21/2002 

PATENT. APPLICATION: US/09/529 , 239B TIME: 18:20:25 

Input Set : N:\Crf4\Refhold\I529239B.raw 
Output Set: N:\CRF4\10212002\I529239B.raw 
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Seg# :93; 
Seqtt : 94; 
Seq#:95; 
Seq#:96; 
Seq# :97; 
Seq# :98; 
Seq# :98; 
Seq# :98; 
Seq# :98; 
Seq# :98; 
Seq# :98; 
Seq# :98; 
Seq# :98; 
Seq# :98; 
Seq# ;98; 
Seq# :98; 



Line(s 
Line(s 
Line(s 
Line(s 
Line(s 
Line( s 
Line(s 
Line(s 
Line(s 
Line( s 
Line{s 
Line( s 
Line (s 
Line(s 
Line(s 
Line( s 



1765 

1775 

1785 

1795 

1805 

1833, 

1844, 

1855, 

1866, 

1877, 

1888, 

1899, 

1910, 

1921, 

1932, 

1943, 



1834, 
1845, 
1856, 
1867, 
1878, 
1889, 
1900, 
1911, 
1922, 
1933, 
1944, 



1835, 
1846, 
1857, 
1868, 
1879, 
1890, 
1901, 
1912, 
1923, 
1934, 
1945, 



1836, 
1847, 
1858, 
1869, 
1880, 
1891, 
1902, 
1913, 
1924, 
1935, 
1946, 



1837, 
1848, 
1859, 
1870, 
1881, 
1892, 
1903, 
1914, 
1925, 
1936, 
1947, 



1838, 
1849, 
1860, 
1871, 
1882, 
1893, 
1904, 
1915, 
1926, 
1937, 
1948, 



1839, 
1850, 
1861, 
1872, 
1883, 
1894, 
1905, 
1916, 
1927, 
1938, 
1949, 



1840, 
1851, 
1862, 
1873, 
1884, 
1895, 
1906, 
1917, 
1928, 
1939, 
1950 



1841, 
1852, 
1863, 
1874, 
1885, 
1896, 
1907, 
1918, 
1929, 
1940, 



1842,1843 
1853,1854 
1864,1865 
1875,1876 
1886,1887 
1897,1898 
1908,1909 
1919,1920 
1930,1931 
1941,1942 
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VERIFICATION SUMMARY 

PATENT" APPLICATION : US/09/529 , 239B 



DATE: 10/21/2002 
TIME: 18:20:25 



Input Set : N:\Crf4\Refhold\I529239B.raw 
Output Set: N:\CRF4\10212002\I529239B.raw 



L:7 M:270 C: Current Application Number differs , Wrong Format 
L:38 M:341 W: (46) B n n or "Xaa" used, for SEQ ID# : 1 after pos . : 0 
L:61 M.-341 W: (46) "n" or "Xaa" used, for SEQ ID#:2 after pos.:0 

L:158 M:200 E: Mandatory Header Field missing, <220> Tag not found for SEQ ID#:12 
L:206 M:200 E: Mandatory Header Field missing, <220> Tag not found for SEQ ID#:15 
L:488 M:200 E: Mandatory Header Field missing, <220> Tag not found for SEQ ID#:19 
L:690 M:200 E: Mandatory Header Field missing, <220> Tag not found for SEQ ID#:26 
L:734 M:200 E: Mandatory Header Field missing, <220> Tag not found for SEQ ID# : 27 
L:1010 M:200 E: Mandatory Header Field missing, <220> Tag not found for SEQ ID# : 31 
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